Blar i AURA på forfatter "Brådland, Terje"
-
Empirical Evaluation of the Bayesian Learning Automaton Family
Brådland, Terje; Norheim, Thomas (Master thesis, 2009)The two-armed bandit problem is a classical optimization problem where a player sequentially selects and pulls one of two arms attached to a gambling machine, and each arm pull results in either a reward or penalty to ...